Pesquisa | Portal Regional da BVS

Growing and cultivating the forest genomics database, TreeGenes.

Falk, Taylor; Herndon, Nic; Grau, Emily; Buehler, Sean; Richter, Peter; Zaman, Sumaira; Baker, Eliza M; Ramnath, Risharde; Ficklin, Stephen; Staton, Margaret; Feltus, Frank A; Jung, Sook; Main, Doreen; Wegrzyn, Jill L.

Database (Oxford) ; 20192019 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-30865259

Growing and cultivating the forest genomics database, TreeGenes.

Database (Oxford) ; 2018: 1-11, 2018 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-30239664

RESUMO

Forest trees are valued sources of pulp, timber and biofuels, and serve a role in carbon sequestration, biodiversity maintenance and watershed stability. Examining the relationships among genetic, phenotypic and environmental factors for these species provides insight on the areas of concern for breeders and researchers alike. The TreeGenes database is a web-based repository that is home to 1790 tree species and over 1500 registered users. The database provides a curated archive for high-throughput genomics, including reference genomes, transcriptomes, genetic maps and variant data. These resources are paired with extensive phenotypic information and environmental layers. TreeGenes recently migrated to Tripal, an integrated and open-source database schema and content management system. This migration enabled developments focused on data exchange, data transfer and improved analytical capacity, as well as providing TreeGenes the opportunity to communicate with the following partner databases: Hardwood Genomics Web, Genome Database for Rosaceae, and the Citrus Genome Database. Recent development in TreeGenes has focused on coordinating information for georeferenced accessions, including metadata acquisition and ontological frameworks, to improve integration across studies combining genetic, phenotypic and environmental data. This focus was paired with the development of tools to enable comparative genomics and data visualization. By combining advanced data importers, relevant metadata standards and integrated analytical frameworks, TreeGenes provides a platform for researchers to store, submit and analyze forest tree data.

Assuntos

Bases de Dados Genéticas , Florestas , Genômica , Mineração de Dados , Ontologia Genética , Fenótipo , Filogenia , Ferramenta de Busca , Software , Árvores/genética , Árvores/crescimento & desenvolvimento

Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation.

Wegrzyn, Jill L; Liechty, John D; Stevens, Kristian A; Wu, Le-Shin; Loopstra, Carol A; Vasquez-Gross, Hans A; Dougherty, William M; Lin, Brian Y; Zieve, Jacob J; Martínez-García, Pedro J; Holt, Carson; Yandell, Mark; Zimin, Aleksey V; Yorke, James A; Crepeau, Marc W; Puiu, Daniela; Salzberg, Steven L; Dejong, Pieter J; Mockaitis, Keithanne; Main, Doreen; Langley, Charles H; Neale, David B.

Genetics ; 196(3): 891-909, 2014 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-24653211

RESUMO

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (â¼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.

Assuntos

Genoma de Planta , Anotação de Sequência Molecular/métodos , Pinus taeda/genética , DNA de Plantas/análise , Evolução Molecular , Genes de Plantas , Família Multigênica , Filogenia , Alinhamento de Sequência

Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies.

Neale, David B; Wegrzyn, Jill L; Stevens, Kristian A; Zimin, Aleksey V; Puiu, Daniela; Crepeau, Marc W; Cardeno, Charis; Koriabine, Maxim; Holtz-Morris, Ann E; Liechty, John D; Martínez-García, Pedro J; Vasquez-Gross, Hans A; Lin, Brian Y; Zieve, Jacob J; Dougherty, William M; Fuentes-Soriano, Sara; Wu, Le-Shin; Gilbert, Don; Marçais, Guillaume; Roberts, Michael; Holt, Carson; Yandell, Mark; Davis, John M; Smith, Katherine E; Dean, Jeffrey F D; Lorenz, W Walter; Whetten, Ross W; Sederoff, Ronald; Wheeler, Nicholas; McGuire, Patrick E; Main, Doreen; Loopstra, Carol A; Mockaitis, Keithanne; deJong, Pieter J; Yorke, James A; Salzberg, Steven L; Langley, Charles H.

Genome Biol ; 15(3): R59, 2014 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-24647006

RESUMO

BACKGROUND: The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. RESULTS: We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. CONCLUSIONS: In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.

Assuntos

Mapeamento de Sequências Contíguas/métodos , Genoma de Planta , Pinus taeda/genética , Análise de Sequência de DNA/métodos , DNA de Plantas/genética , Haploidia

Tripal: a construction toolkit for online genome databases.

Ficklin, Stephen P; Sanderson, Lacey-Anne; Cheng, Chun-Huai; Staton, Margaret E; Lee, Taein; Cho, Il-Hyung; Jung, Sook; Bett, Kirstin E; Main, Doreen.

Database (Oxford) ; 2011: bar044, 2011.

Artigo em Inglês | MEDLINE | ID: mdl-21959868

RESUMO

As the availability, affordability and magnitude of genomics and genetics research increases so does the need to provide online access to resulting data and analyses. Availability of a tailored online database is the desire for many investigators or research communities; however, managing the Information Technology infrastructure needed to create such a database can be an undesired distraction from primary research or potentially cost prohibitive. Tripal provides simplified site development by merging the power of Drupal, a popular web Content Management System with that of Chado, a community-derived database schema for storage of genomic, genetic and other related biological data. Tripal provides an interface that extends the content management features of Drupal to the data housed in Chado. Furthermore, Tripal provides a web-based Chado installer, genomic data loaders, web-based editing of data for organisms, genomic features, biological libraries, controlled vocabularies and stock collections. Also available are Tripal extensions that support loading and visualizations of NCBI BLAST, InterPro, Kyoto Encyclopedia of Genes and Genomes and Gene Ontology analyses, as well as an extension that provides integration of Tripal with GBrowse, a popular GMOD tool. An Application Programming Interface is available to allow creation of custom extensions by site developers, and the look-and-feel of the site is completely customizable through Drupal-based PHP template files. Addition of non-biological content and user-management is afforded through Drupal. Tripal is an open source and freely available software package found at http://tripal.sourceforge.net.

Assuntos

Biologia Computacional , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma , Internet , Mineração de Dados

Complete genome of the onion pathogen Enterobacter cloacae EcWSU1.

Humann, Jodi L; Wildung, Mark; Cheng, Chun-Huai; Lee, Taein; Stewart, Jane E; Drew, Jennifer C; Triplett, Eric W; Main, Doreen; Schroeder, Brenda K.

Stand Genomic Sci ; 5(3): 279-86, 2011 Dec 31.

Artigo em Inglês | MEDLINE | ID: mdl-22675579

Candidate gene database and transcript map for peach, a model species for fruit trees.

Horn, Renate; Lecouls, Anne-Claire; Callahan, Ann; Dandekar, Abhaya; Garay, Lilibeth; McCord, Per; Howad, Werner; Chan, Helen; Verde, Ignazio; Main, Doreen; Jung, Sook; Georgi, Laura; Forrest, Sam; Mook, Jennifer; Zhebentyayeva, Tatyana; Yu, Yeisoo; Kim, Hye Ran; Jesudurai, Christopher; Sosinski, Bryon; Arús, Pere; Baird, Vance; Parfitt, Dan; Reighard, Gregory; Scorza, Ralph; Tomkins, Jeffrey; Wing, Rod; Abbott, Albert Glenn.

Theor Appl Genet ; 110(8): 1419-28, 2005 May.

Artigo em Inglês | MEDLINE | ID: mdl-15846479

RESUMO

Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].

Assuntos

Mapeamento Cromossômico , Bases de Dados Genéticas , Etiquetas de Sequências Expressas , Genoma de Planta , Prunus/genética , Cruzamento/métodos , Cromossomos Artificiais Bacterianos , Biblioteca Gênica , Plasmídeos/genética , Análise de Sequência de DNA

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA